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Abstract 

The Traveling Salesman Problem (TSP) is among the most famous NP-hard optimization 
problems. We design for this problem a randomized polynomial-time algorithm that computes 
a (1 + ^-approximation to the optimal tour, for any fixed e > 0, in TSP instances that form an 
arbitrary metric space with bounded intrinsic dimension. 



The celebrated results of Arora [ Aro98 | and Mitchell [ Mit99 prove that the above result 
holds in the special case of TSP in a fixed-dimensional Euclidean space. Thus, our algorithm 
demonstrates that the algorithmic tractability of metric TSP depends on the dimensionality of 
the space and not on its specific geometry. This result resolves a problem that has been open 
since the quasi-polynomial time algorithm of Talwar [Tal04 . 
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1 Introduction 



Among all NP-complete problems, the Traveling Salesman Problem (TSP) stands out as funda- 
mental and extensively studied. Indeed, numerous articles and even whole books ( [ |Rei94 , LLKS85| , 
GP02| , IABCC07H ) are devoted to TSP, studying various algorithms for different families of instances. 
In fact, some of the most basic techniques in combinatorial optimization were devised to tackle TSP, 
including for instance cutting planes. The input for (the optimization version of) TSP is a com- 
plete graph, whose vertex set we denote by S = [n], together with edge- weights w(-,-) that are 
nonnegative and symmetric,[] and the goal is to find a closed tour of S of minimum (total) weight, 
where a tour T is simply a permutation of S, i.e. it visits every vertex exactly once. 

A prominent special case of TSP, called metric TSP, is where the edge-weights satisfy the 
triangle inequality,^ and hence the input is simply a (finite) metric space on the point set S = [n]. 
The importance of this variant lies in the fact that edge-weights arising in many of the typical 
applications naturally represent lengths and distances. Metric TSP offers some basic structure that 
may be leveraged by algorithms. In particular, Christofides [Chr76] designed a 1.5-approximation, 
a polynomial-time algorithm that computes a tour whose weight exceeds the optimum by a factor of 
at most 1.5. It is a long-standing open problem to improve this approximation for metric TSP, but 
it is known that there exists a constant c > 1, for which c-approximation is NP-hard [ PY93| . |PV0G ]. 

and Mitchell [Mit9S] prove that the important special case 



Celebrated results of Arora | Aro 



of metric TSP where the input metric forms a Euclidean metric, admits a PTAS.^] To be more 
precise, these PTAS results apply to input metrics that are finite subsets of a fixed- dimensional 
Euclidean metric. Observing that these PTAS results require two separate conditions - Euclidean 
space and fixed dimensionality - it is only natural to ask: 

Question 1.1. Do TSP instances which satisfy only one of the bounded dimension and Euclidean 
metric properties admit a PTAS? 



The bounded-dimensionality requirement turns out to be necessary, as Trevisan | TreO0| 1 shows 
that TSP in Euclidean metrics (of dimension log n) is NP-hard to approximate within some constant 
c > 1. It is therefore not surprising that the running time of the aforementioned PTAS is doubly- 
exponential in the dimension. 

Eliminating the Euclidean requirement was first addressed by Talwar [Tal04]. Observe that 
a basic premise of this question is that the notion of dimension applies to an arbitrary (non- 
Euclidean) metric space. This is indeed possible, and Talwar relied on a definition put forth by 
Gupta, Krauthgamer and Lee [ GKL03| ] (following [Ass83, Cla99| ]): The doubling dimension of 
a (finite) metric space S, denoted ddim(S'), is the smallest k > such that every ball in the 
metric can be covered by 2 k balls of half the radius. This definition is essentially based on volume 
growth, and indeed simple volume estimates imply that a /c-dimensional Euclidean metric has 
doubling dimension Q(k). The opposite direction, however, is not true and in fact the family of 
metrics with bounded doubling dimension is significantly larger than that of bounded-dimensional 
Euclidean metrics (see [ LaaO0| , LP01, Laa02, GKL03| for details). Talwar [Tal04] generalized much 



of Arora's machinery [ Aro98] and showed that TSP in metrics with fixed doubling dimension admits 



1 Formally, w(x,y) = w{y,x) > for all x,y £ S. 

2 The triangle inequality says that w(x, y) < w(x, z) + w(z, y) for all x,y,z G S. 

3 PTAS, which stands for a Polynomial- Time Approximation Scheme, means that for every fixed e > there is a 
(1 + e)-approximation. Note that the runtime is polynomial in n for every constant e > 0. 
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a QPTAS.Q But despite repeated attempts, the original goal remained open: 



Question 1.2. Does TSP in metrics of bounded doubling dimension admit a PTAS? 



This question has fascinated researchers for several reasons (see e.g. | LcclO| ). First, the existence 
of a QPTAS may be interpreted as evidence that a PTAS is possible. Second, the above question 
accords well with a research program that was initiated in [|GKL03| , [KL04| , [Tal04|1 , and studies 
the analogy between Euclidean metrics of fixed dimension and general metrics of fixed doubling 
dimension, from the perspective of algorithmic tractability. Empirically, it has been observed that 
many algorithms dealing with the former family of metrics can be adapted to deal with the latter, 
see e.g. [ ACGP10| , GKK10 | and references therein for recent instantiations. 

A natural approach to resolving this problem in the positive would be to embed the original 



metric space in bounded dimensional Euclidean space (such embeddings were studied in [ ABN08 , 
ABNllfl ), and then apply the PTAS of [Aro98, Mit.99( |. While this general approach has been quite 



successful in resolving many other algorithmic problems (see for example [ Bar96| | ) , it fails here since 
any such embedding must have non-constant distortion ([ Laa00| , LP01, Laa02( |), in fact £2(\/log n) 
( |GKL03[ ) . It appears that achieving a PTAS for arbitrary bounded-dimensional metric spaces 
requires a new approach to bypass the limitations of the embedding. 



1.1 Results 

Our central contribution is a PTAS for TSP in metrics of fixed doubling dimension. 

Theorem 1.3. A (1 + e)- approximation to the optimal tour of a metric TSP instance S on n = \S\ 
points can be computed by a randomized algorithm in time n 2 ( ddim ( s )) . 2(2 ddim ( s >/£)° (ddim S) Vl°g»\ 

The previously known running time is is quasipolynomial in n, namely 2( ddim ( 5 )/ £ ' lo s n )° (ddim(S)) , 
due to Talwar | Tal04| , Theorem 8]. 



1.2 Techniques 



We build upon the framework of [ Aro98 , Tal04 1 , and introduce two main new ideas (and several 
more minor ones). Our framework is a variant of Talwar's algorithm (see Section ^) that includes: 
(1) a randomized hierarchical clustering (partitioning) of S; (2) the introduction of portals around 
every cluster; (3) slightly modifying the optimal tour (for sake of analysis only) so that the tour is 
portal-respecting (crosses every cluster only at its portals) and has few crossings into the cluster; 
(4) a dynamic program that computes a tour for each cluster based on the tours already computed 
for its subclusters. 



Our first new idea (in Section 1.5) is to estimate the cost incurred by an optimal tour inside 
a ball. Intuitively, the estimate is merely an instantiation of the well-known 2-approximation of 
TSP using the minimum spanning tree (stated as Lemma |L~6| ). But in reality, edges entering and 
exiting the ball interfere with this calculation, and thus the estimate includes both multiplicative 
and additive error terms. 

Our second new idea is to treat separately dense regions in the metric space, meaning balls 
in which an optimal tour incurs a relatively large cost. Now if all regions are sparse, then we are 



4 QPTAS, which stands for a Quasi-Polynomial Time Approximate Scheme, means that for every fixed e > there 
is a (1 + e)-approximation running in quasi-polynomial time 2 polylog ' n ). 
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almost done - in this case we use limited randomization to determine the hierarchical clustering. 
In particular, we draw at random O(logra) radii- values for every cluster center, and argue that 
with high probability at least one of them is useful for the construction of a good partition. We 
then augment the aforementioned dynamic program to search also over the different radii-values 
for every cluster center. (This appears in Section [O].) If there is a dense region, then we can use 
the first idea above to find the (nearly) smallest one. We then "split" the TSP instance into two 
portions, roughly the inside and the outside of that dense region. The outside is solved recursively. 
The inside portion is nearly sparse because it can be covered by a limited number of smaller (and 
thus sparse) balls, and so it can be solved immediately by the sparse algorithm. Stitching the 
solutions for the two portions may be costly, but since the region is dense, we can effectively charge 
our algorithm's cost to that of the optimum. (This appears in Section |3.2| .) 



1.3 Related work 



A few hardness of approximation results are known. That general (not necessarily metric) TSP 
is NP-hard follows immediately from Karp's original NP-hardness proof for Hamiltonian cycle 
|Kar72(| . Moreover, this proof shows that TSP does not admit any finite factor approximation in 
polynomial time, unless P=NP. Papadimitriou and Yannakakis |PY93] showed that metric TSP 
is hard to approximate within some constant factor c > 1, even if all the metric distances are 
either 1 or 2. Papadimitriou and Vempala ||PV06|| proved that approximating metric TSP within 
factor 220/219 is NP-hard. Papadimitriou [ [Pap77|| showed that two-dimensional Euclidean TSP is 
NP-hard. 



The runtime of Arora's algorithm [ Aro98[| was later improved in [RS98], and his geometric ap- 
proach was subsequently employed for other Euclidean problems in [CL98, ARR98. CLZ02, KR07]. 
Further extention of the algorithms of | Aro9S| , Tal04] to the problem of TSP with neighborhoods 
(under mild conditions) include ||Mit07 | and [CE11]. Chan and Gupta [ CG08| gave an algorithm 
for TSP that runs in sub-exponential time in metrics in which an alternative (more general) notion 
of dimension is assumed to be bounded. 



1.4 Preliminaries 

Recall our notation for the metric TSP instance: S denotes the set of points, d(-, •) their pairwise 
distances, ddim(5) its doubling dimension, and n = \S\ its size. We fix < e < 1/20, which 
determines the desired approximation to be l + 0(e). We may assume that e > 1/n, as otherwise all 
our results hold trivially — TSP can be solved exactly in time n! by straightforward enumeration, 
providing better approximation and faster runtime than our claimed runtime (which is at least 



exponential in 1/e). Similar to what was done by Arora [Aro98], by suitable scaling we may 
assume that the minimum interpoint distance in S is 1 and the diameter is 0(n/e) = 0(n 2 ). 

As usual, the metric may be viewed as the complete graph on S, with edge weights corresponding 
to pairwise distances, denoted w(x,y) = d(x,y). A set of points S' C S is sometimes called 
a cluster. We let MST(5") denote a minimum spanning tree (breaking ties arbitrarily) of the 

complete graph on S' . The ball centered at x G S with radius R > is defined as B(x, R) d = {y £ 
S : d(x, y) < R}. We define B*(x, R) to be the edges of the complete graph on B(x, R). 



Tours. Throughout, a tour is a finite sequence of points; by convention, it is undirected. A 
transition in the tour is a pair of successive points in the sequence, which may be viewed as an edge 
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in the complete graph on S. A closed tour is defined in the natural way by adding a transition 
between its two endpoints (and then no point is an endpoint in the tour). 

The weight of a multiset M of transitions is defined as w(M) = f ^/ y ^^j w(x, y). This notation 
naturally extends to a tour T, by viewing T as sequence of transitions, and so w(T) just denotes 
the (total) length of the tour T. 

Let OPT(S") denote a minimum length closed tour that visits all points of S' C S. 

Lemma 1.4. If a tour T traverses an edge e more than twice, then there exists a lighter tour T' 
that is a subset of the transitions of T. 

Proof. We will prove the case of three traversals, and a similar proof holds for more. Suppose that 
T traverses some edge e = (it, v) exactly thrice. Then an ordering of the edges of T must take the 
form E\eEieE^eE/±, where each Ei is a (possibly empty) set of edges, and e may be traversed in 
either direction. If E2 or E3 are empty, then the tour is of the form E\eeE^eE^ or ExeE^eeE^ and 
the segment ee may be deleted. Now, if E2 is a cycle beginning and ending at u (or v), then one 
of EieeE^E2eE^ or E\eeE-ieEiE/± is a valid reordering of T, and the segment ee may be deleted, 
resulting in a lighter tour. Assume then that E2 begins at u and ends at v, and trivially E3 also 
begins at u and ends at v. Then the segment E^eE^e which begins and ends at u may be replaced 
by a path that follows E2 and then traverses the edges of £3 in a backwards order, thereby skipping 
e twice. □ 



Doubling dimension. Let \s > be the doubling constant of the metric S, the smallest value 
such that every ball in S can be covered by \$ balls of half the radius. Recall that the doubling 
dimension of S is ddim(5) = log 2 Ag > 1 (assuming \S\ > 2). The following packing property can 
be demonstrated via repeated applications of the doubling property (see e.g. | GKL03|| ). 



Lemma 1.5. (Packing Property) Let S' C 5 have minimum interpoint distance a > 0. Then 

^ / 2diam(g'A ddim ( g ) 



j'| < ^ 2_dis 

Notice that whenever dlSLn ^ s ) > 2, we can further bound \S'\ < ( dmv ^- s ) \ 



Nets. Similar to what was described in |GGN06 , KL04], a subset S' C S is an (a, b)-net of <S (for 



< a < 6) if it satisfies the following two properties. 

(i) . Packing: For every u,v £ S we have d(u,v) > a. 

(ii) . Covering: Every v G 5 is within distance 6 of some point it G 5', i.e., S C U u& s'B(u,b). 

We say that u £ S' covers v G 5 if (i(n, u) < 6. The previous conditions require that the points of 
<S" be spaced out, yet nevertheless cover all points of 5. If a = 6, we may refer to the (a, 6)-net as 
an a- net. 

Hierarchy of nets (or point hierarchies). Set L ^ [log s diam(S')] = 0(log s n), for a parame- 
ter s > 4. (In Section || we will require s roughly (log n ) 1 / ddim ( s ') - ) p or each i = 0, . . . , L, fix Hi C S 
to be an s*-net of S, called the i-level net, or the s*-scale net. Using a simple greedy construction, 
we may assume that Hi C Hi-±. Notice that the bottom level i = contains all points, and the 
top level i = L contains only a single point. 
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Spanning trees, tours, and patching. It is well-known that the optimal tour on a set S' is 
approximated within factor 2 by the minimum spanning tree on S'. 

Lemma 1.6. Let S' C S. Then io(MST(5')) < w(OPT(S')) < 2w(MST(S")). 

The following lemma, due to Talwar |fTalQ4| | (see also BSmiloll ), uses the doubling dimension to 
bound u;(MST(S)). 

Lemma 1.7. Let S' C S. Then io(MST(5')) < 4|S'| 1 ~ 1/ddim(5) • diam(S"). 

The following lemma, due to |Aro98| , [Tal04|| , is known as the Patching Lemma for doubling 
spaces. We say that a transition (x, y) in a tour T crosses a cluster C C S if exactly one of x, y 
belongs to C. The point (among x,y) that belongs to C is called a cross-point. 

Lemma 1.8 (Patching Lemma). Let T be a tour that crosses a cluster C r times. Then the number 
of crossings can be reduced to two, at an additional cost of at most 4 times the minimum spanning 
tree ofT's cross-points (denoted C). That is, there is a tour T' that crosses C at most twice and 
(plugging in Lemma 

w{T') < w(T) + 4w(MST(C)) < w(T) + iQr 1 "^^ diam(C). 

We also present the following lemma, which is a version of the Patching Lemma tailored to our 
specfic needs. 

Lemma 1.9. Let T be a tour of S, and C C S be a cluster. Consider the tour segments Tj 
i = 1, . . . , k that result from removing from T edges not found in the full graph on C , and let C be 
the cross-points of C. Then there exists a closed tour T' on C which contains only edges in Ti and 
MST(C) and 

k k 

w{T') < 4w{MST(C)) + J2 w ( T i) ^ lOr 1 " 33 ^) diam(C) + ^w(Ti). 

i=l i=l 

Let C C S, and let C be the cross-points of C. We say that a tour T exits C if it transitions 
from a point of C to a point of C (the exit point) and then to a point outside the cluster. Similarly, 
we say that a tour T enters C if the tour transitions from a cross-point in C (the entry point) to 
another point of C, after having immediately arrived at the cross-point from a point outside C. 

Lemma 1.10. Given any tour T, there exists a tour T' which is a subset of the transitions of T, 
such that T' enters and exits all disjoint clusters at most twice at each cross-point. 



Proof. The proof is an easy application of Lemma 1.4. For each cluster C, represent each crosspoint 



x as two points x\ and X2 with infmtesimally small distance between them. Let edges connecting 
x to points outside C be incident on x\, and edges connecting x to points of C be incident on X2- 
Then edge e = (xi,X2) is traversed at most twice. □ 



5 



Net-Respecting Tours. A tour is said to be net respecting (NR) if every transition in it, say 
of length £, has both of its endpoints belonging to every net Hi with s l < e£. (This definition is 
with respect to a given hierarchy {Hi} and e > 0.) Since T~L is a hierarchy, it actually suffices for 
the endpoints to belong to Hi for the maximum i such that s i < el.) We denote by OPT NR {S') 
an optimal (minimum length) net-respecting tour that visits all points of S' C S. 

Lemma 1.11. Every tour T can be converted to a net-respecting tour T' such that w(T') < (1 + 
6e)w(T) and T' visits all points visited by T. 

Proof. Simply replace a transition (x,y) with (x',y'), where x',y' are the i-net points covering 
x, y respectively (where level i is the highest level such that s l < |d(x,y)), and connect x to x' 
(and similarly y to y') via a sequence of net points (from levels j = i — 1, i — 2, . . .) covering 
x. Clearly d(x',y') < (1 + e)d(x,y), and the path from x to x' (or y to y') has length at most 
Ej-<i^ < 2s ' i <2ed(x,y). □ 

1.5 Local behavior of optimal tour 

We next show that the weight of the optimal (net-respecting) tour inside some neighborhood can 
be approximated using a minimum spanning tree of points in that neighborhood. 

Lemma 1.12. Let OPT NR (S) be an optimal net-respecting tour visiting all points in S. Then 

(i) . w{OVT NR (S) n B*(u, R)) < 6(1 + 6e) • w(MST(B(u, R))). 

(ii) . w(MST(B(u, R))) < w(OPT NR (S) D B*(u,4R)) + (s/e) 2ddim ^ R. 

Proof of Lemma Assume by contradiction that does not hold. Applying the Patching 

Lemma to the tour OPT NR (S) with respect to the cluster B(u,R), we get a modified tour which 
visits all of S and crosses that cluster at most twice, while increasing the tour's length by at most 
4MST(i?(u, R)). Now replace the portion of this tour between these unique cross-points with a 



tour that is derived from an MST of B(u,R) (Lemma 1.7), and thus adds total length of at most 
2w(MST(B(u, R))). Convert the resulting tour to be net-respecting (Lemma 1.11] ); this step clearly 
affects only the newly-added edges. This entire process first removes from the tour a total length of 
at least w{OPT NR (S) n B* (u, R)), then adds a total length of at most 6(1 + 6e) • w(MST(B(u, R))). 
But since [ij does not hold, it means the overall tour length has strictly decreased, which contradicts 
the optimality of OPT NR (S). 

To prove part (ii) , consider a tour OPT NR (S). It can be partitioned into subtours T\, T%, T3, . . ., 



where for odd k contains only points in B(u,R) and is maximal with respect to containment. 
Thus, Tf, for even k has its endpoints inside B(u, R), overlapping Tk_x and T^ +1 , and the remaining 
points in Tk-i and Tj~+i are outside B{u,R). By definition, w{OPT NR {S)) = J2k w ( T k)- 

We will now construct a connected graph that spans B(u,R), and use a charging argument to 
bound its weight. First, take Ti,T^,T^, . . .; we can pay for them using their own contribution to 
OPT (S) H B*(u,4:R). Now consider any T^k- If this T2k visits only points inside B(u,4R), then 
again take T^k itself, paying for it using its own contribution to OPT NR (S)nB*(u,4R). Otherwise, 
we know that Tik crosses the ball B(u,4R), and obviously this happens (at least) twice. If one 
of the two corresponding cross-points is inside B(u,4R) \ B(u,3R), then this T^k contributes to 
OPT {S) n B*(u,4R) at least 2R, which suffices to pay for connecting T2k-\ and T^k+i with a 
direct edge. 
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The final case is when T<ik crosses the ball B(u,4R), obviously (at least) twice, and in both 
crossings the cross-point is inside B(u,3R), implying that this transition has length at least R. 
Let i be the largest value for which s l < eR. By the net-respecting property, the cross-point must 
belong to an s l -net. Although we may have many such subtours T2k, the number of distinct s*-net 
points inside B{u,3R) is at most (J^) ddim(5) < ^(s/e) 2ddim ( s \ and we then connect all these net 

points to an arbitrary point, say in T±, at a cost of at most (s/e) 2ddiD1 ^ • R. We also take the 
portion of that goes until these cross-points (which are also the said net-points), paying for 
this portion using T2fc's own contribution to OPT NR (S) n B*(u,4R). Overall, the edges that we 
take are easily seen to form a connected subgraph that spans all vertices of B(u,R) and has total 
weight at most w(OPT NR {S) n B*(u, 4R)) + {s / e) 2 ddim( - s) R; thus, the weight of the MST on these 
points cannot be larger. □ 

2 TSP via hierarchical clustering (Arora and Talwar) 

As an exposition to our PTAS, we review a variant of the algorithm of Talwar [ Tal04f| (and in 



turn Arora Aro98| ) , which uses hierarchical clustering to compute a (1 + e)-approximate tour 



in quasi-polynomial time. Recall that we may assume (by arguments found in [Aro98, Section 
2.1.1]) that the instance of TSP is a set S with minimum interpoint distance 1 and diameter 
0(n/e) = 0(n 2 ). The construction uses a hierarchy of nets as described above. We first introduce 
the single-scale partition invoked by the algorithm. This partition follows the same framework used 
in [Bar96, Bar98, FRTOSj , GKL03, ABN11], and is slightly different from the one that appeared in 



[ Tal04 1 in that it uses the exponential distribution. 



Single-scale probabilistic partition. Fix a set S' C S to be partitioned. Fix a level i, and 
impose an arbitrary ordering ir on the points of the s*-net Hi C S. The clusters are formed one by 
one following the ordering n. Each point of Hi constitutes a cluster center. With each net-point 
u £ Hi we associate a random radius h u £ [s*,2s 4 ] from an exponential distribution.^ The ball 
B(u, h u ) constitutes a new cluster of S", which is removed and then the process continues to form 
the rest of the clusters. The boundary of u's cluster (i.e. edges that cross the cluster) is determined 
only by the ordering imposed by ir, and by the balls associated with cluster centers at distance at 
most 4s* from u. By the packing property, there are at most 2 4ddim ( s ) such cluster centers. 



The next claim follows from [ ABN11]. 



Claim 2.1. For every u,v £ S' C S, the probability that the single-scale probabilistic partition 
assigns u and v to different clusters (they are cut) is at most c ddim ^) ci ( M ' tl ) j or some absolute 
constant d > 0. 

Hierarchical clustering. To create the clustering, we first choose a single-scale partition for the 
top level L. As described above, each net point chooses a radius in the range [s L ,2s L ], and then 
every point in S is assigned to the earliest ball that covers it. For the next hierarchical level L — 1, 
we take each L-level cluster separately, and build for its points a new partition with random radius 
in the range [s i_1 , 2s L ~ 1 ]. (We may assume for simplicity that the partition employed inside each 



3 The density function of the distribution can take the form: — 2 ^_ s ddi ( m ( S) • 81n2dl ^' m ( s ) .2 •« r , where r € 



[s\ 2s l ], and for all other values of r (see ABN11|) 
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L-level cluster is the same, i.e. it uses the same ordering and random radii, although the analysis 
does not require this assumption.) The construction continues recursively until level 0, the bottom 
level. Note that by the packing property (Lemma 1.5), each cluster has at most s 2ddim ( 5 ') child 
clusters. 

The probability that u and v are cut at level i (found in different i-level clusters) is bounded by 
the sum of the probabilities that they are cut in any level i or higher, that is X^j=i c ddim (S)d{u,v) _ 

TSP algorithm and analysis. The dynamic programming TSP algorithm functions on the 
hierarchical clustering above. A tour is (m, r)-light with respect to the hierarchical partition if it 
crosses each i-level cluster at most r times, and only at xf-net points, the cluster portals, where m 
is an upper bound on the number of portals. We choose s > 4, and let M be the smallest power of 
s that is greater or equal to ddim( f logw , so m < Af2ddim(S) < ^ddim(S)logn ^ 2ddim ( g > j 

An optimal (m, r)-light tour for the hierarchical clustering can be computed by dynamic pro- 
gramming as follows: Consider a cluster C. Any valid (m, r)-light tour crosses C at most r times 
and only at portals, so it consists of r paths starting and ending at portals. A configuration is a 
multiset of r or fewer portals partitioned into pairs (each representing an entry /exit pair). A single 
portal may appear more than once in the configuration if the tour crosses it multiple times, but 
each instance counts towards r. A cluster has m portals, so there are no more than m r possible 
configurations. Now, if optimal (m, r)-light tours have been inductively computed for the (at most) 
s 2ddim(S) chiidpgn f (j un der the hierarchical clustering, the optimal (m,r )-light tour for C can 
be computed by a brute-force algorithm: Since the (m, r)-light tour of each child cluster enters 
and exits via a portal, we can "stitch" together the child tours through the child portals. For 
each fixed configuration of C (at most m r possible configurations), we consider all possible child 
configurations (at most m s2ddim r ). Having fixed a configuration for every child cluster, we have at 
most s 2ddirn ( s ) r candidate child portals where the tour may cross. Since each child portal may be 
connected to one of at most s 2ddlia ( s ) r other candidate child portals, all possible graphs connecting 
these portals can be enumerated in time bounded by ( s 2ddim ( s ') r ^ s r < r s 4ddimS r_ Then we 

consider all graphs that support a tour connecting all child portals to portals of C; we choose the 
graph with the least cost tour. The total runtime is (mr) s ° ( 



Crucially, it follows from [Tal04] that with constant probability, the hierarchical clustering 
for S admits an (m,r)-light tour with weight at most (1 + e) OPT(5), for s = 4, and m,r = f 
^ ddim(5)iog 3 ra ^j ^ The proof proceeds as in [Aro98|, by showing that an optimal tour can be 



slightly modified to observe this property. The cost of modifying the tour is charged to the tour's 
edges, and the analysis shows that the cost charged to each edge is small. Briefly, the probability 
that an edge e = (u, v) is cut by the i-level partition is bounded by c d ( u < v ) ddmi ( s ) _ -^y e then move 
this edge to be incident on a -ff-net point, at an additive cost (increase in tour length) of ^j. 
Hence, the expected cost of moving e due to a cut at level i is 6 d(u,v) ddim(S) 2s^ _ q ( ed(u,v) 



M \ log s n 

and the expected cost of moving e due to a cut in any of O (log s n) levels is 0(e ■ d(u, v)). Now, if 
the optimal tour crosses an i-level cluster some r Q ^ r times, the tour is patched via the minimum 
spanning tree on the cross points (via Lemma |L^) . The cost is charged to the edges participating in 

/ i l-l/ddim(S) \ / j \ 

the patching, at a per edge cost of O ( ^-^ — J = O f ^ — J . But an edge participates 
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in a patching only if it is cut, which happens with the above probability, hence the expected charged 
cost to e due to patchings at one level is O ^ rf ("^) ddim ( g ) _ __^e__ s j = q ^ £ffe^) j ^ an( j ^ ue to 

patchings at all levels is 0(e ■ d(u, v)). The values for m and r imply that the algorithm above runs 
in quasi-polynomial time. 



3 Obtaining a PTAS 



In this section, we prove Theorem 1.3, the central contribution of this paper. 



Our algorithm for TSP again mimics the one employed by Arora [Aro98|. His algorithm requires 



a hierarchical partition, yet we cannot directly employ the partition of Section g. That clustering 
essentially decides the cluster assignment for each level separately, and hence it cannot successfully 
invoke the analysis of | Aro98|| to bound the expected cost of patchings per level. More precisely, the 



event that edge e is cut by an i-level single-scale partition, and by no other single-scale partition, 
is still G ( ddl ™^ ^ . Hence, the expected cost of participating in an z-level patching is mostly 
independent of the expected cost of participating in a j-level patching for all i ^ j, and so a term of 
L = 0(log s n) must appear in r. This is precisely the reason why the analysis presented by Talwar 
| Tal04d does not achieve a PTAS for metric TSP. 



Instead, we will employ a modified version of the above partition, and analyze its performance 
on net-respecting tours. We will show that if a tour obeys some edge-sparsity property, then it 
admits an (m, r)-light tour on a hierarchy very similar to the one above. Crucially, the edge-sparsity 
property allows us to achieve r = 0((logn) c ) for a small constant c < 1, which implies a polynomial 
runtime. (Although we fix the value of c in the analysis, it can in fact be taken as an arbitrarily 
small constant.) This partition can be found by a "brute- force" version of the above dynamic 
programming algorithm. We then show that if the tour has an edge-dense area, then the point set 
S can be broken into two pieces, and TSP solved separately on each. 

While a regular optimal tour need cross a point only twice, in an optimal net-respecting tour a 
net-point may have many edges incident upon it. Because of this, we will slightly modify the term 
(m, r)-light to mean that tour exits or enters the cluster at most r times at the portals, although 



there may be many crossings incident on a portal. Recall from Lemma 1.10 that we may assume 



that a tour enters or exits the internal cluster points at most twice via a single portal. 



3.1 An algorithm for sparse tours 

A tour T is said to be q-sparse with respect to a net-respecting hierarchy Hi, . . . ,Hl if for all 
i G [L] and u S Hi, the edges of T inside the ball B(u,3s l ) have weight w(T f] B*(u,3s 1 )) < qs % . 
The last inequality will be referred to as g-sparsity of that ball. 

Suppose that an oracle had informed us that S admits a net-respecting tour that is a (1 + e)- 
approximation to OPT(5) and is g-sparse. (An oracle with a similar capability is presented in 
Section |]| for q = ( s / e )°( ddim ( s ')) • 2°( ddim2 ( 5 )). Nevertheless, the following lemmas are stated for 
general q.) Then we can prove the following lemma. 

Lemma 3.1. Suppose S admits a net-respecting q-sparse tour T. Then there exists a hierarchi- 
cal clustering for S which admits an (m,r) -light tour T' with w(T') < (1 + e)w(T) for m : = 
(sddim(5)/e • log s n) 2ddim(5) and r = r{q) := 10 • 2 4ddim ( s )glog s logn + (2c' ddim(5)/e) ddim(S) + 

( s / £ )2ddim(S)_ 
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We remark that the tour T" need not be net-respecting. 



Proof. Fix T. The hierarchical clustering closely follows the description from Section ||, with the 
only difference being that the cluster radii are chosen a little more carefully. Consider a net-point 
u € Hj. Because of the g-sparsity of the ball B(u, 3s- 7 ), the weight of edges in T that have length at 
most s 3 and at least one endpoint inside B(u, 2s 3 ) is at most qs 3 . Recall that we wish to assign u a 
random radius h u E [s 3 ,2 s 3 ]. Let V be the set of values which cut fewer than lOq ddim(S') edges of 
T of length at most s 3 . A simple averaging calculation shows that at least a fraction 1 — 10(M im(,g) °f 
radii in [s 3 , 2s 3 ] belong to V (fraction here means with respect to uniform distribution). We choose 
h u randomly from an exponential distribution on [s J ,2s J ] |ABN11], and resample until finding a 
h u G V. Note that the exponential distribution implies that the probability that a sampled radius 
belongs to V is at least 1 /2 (this can be easily seen by considering the "worst-case" scenario where 
V C [(1 + 10dd j m (g) )s 3 \ 2s 3 ]). Then the clustering is done exactly as before (iterating over centers 
etc.), but clearly the actual number of edges cut can only be smaller (because of other balls cut 
earlier in the same level or at a higher level). Note that knowledge of T was necessary only to 
determine which radii are valid choices for h u . 

We now analyze the expected cost of converting the tour T to be (m, r)-light with respect to 
this hierarchical clustering. We first consider the cost of forcing the tour to cross every cluster only 
through its m cluster portals. The probability that an edge e = (u, v ) G T is cut by the i-level 
partition is bounded by 2c d ( u < v ) ddim(g) probability that the edge is cut conditioned on choosing 
a valid radius). We then move this edge to be incident on a jj-net point, where M is the smallest 
power of s at least ddim (^) lo Ss " _ phe cos t Q f this modification is < So the expected cost of 
moving e due to a cut in level i is at most 2c d (u,v) ddim(g) _ 2s < q ( ed(u,v) \ ^ an( ^ ^ e expected cost 



s l M — y l°g s n 

of moving e due to cuts in all L = O (log s n) levels is 0(ed(u, v)). The number of portals is at most 

( 12s' \ ddim ( g ) . / l2sddim(5)log s n \ ddim ( S ) , , n pr^is 

m := I *A M 1 < I ^ — ^— I by the packing property (Lemma [Lap. 

We turn to the analysis of reducing the number of entrance and exit points to r via patching. 
Consider some i-level cluster C. We define an edge to be long with respect to level i if its length 
is at least s l , and otherwise we consider it short. We break the analysis into two parts, and bound 
the number of crossings due to long and short edges separately. It turns out that long edges don't 
figure into the patchings. 

We analyze short edges first. Recall that in the construction of the clustering, each j-level ball 
cuts at most lOgddim(iS) edges of length at most s 3 . Further, edges crossing our cluster C could 
have actually been cut by any of at most 2 3ddim ( s ) neighboring balls at any level j > i (because 
there are at most 2 3ddim (' s ) j-level balls within distance As 3 from C's center). Therefore the number 
of short edges of length at most s l crossed by C due to j-level ball cuts (for any j > i) is at most 
2 3ddim(S) ddim ( 5 ) g _ get r , = r /( g ) . = max { 2 () . 2 3ddim ( 5 ^ddim(5)log s logn, ^ 2c ' d d im (g) ^ ddim(5) } 

and consider the case where C cuts more than r' short edges. Then at least r'/2 of the short edges 
crossing C must have been cut by balls in levels j' >i + 21og s logn, and we can charge a patching 
at level i only to these short edges. We then see that a short edge cut by a j'-level ball is only 
charged for patchings that occur in levels i < f — log s logn. 

Now, if more than r' short edges cross C, the tour is patched via the minimum spanning tree (a 

I — -1, / i /l-l/ddim(S) \ / j \ 

la Lemma |L£|) , at a per edge cost of O ( — — J = O ( dd f m £ (g) J • Recall though that the edges 

charged for this patching are edges cut by balls at levels % + log s log n or higher. It follows that the 
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expected cost to edge (u, v) due to a patching at level i is O y ^+l^fl™^ • 335^5) ) = y iog'n ) > 
and due to patchings at all levels is 0(ed(u, v)). This concludes the analysis for the short edges. 

We will now argue that the number of long edges entering and exiting C is bounded by 
(s/ / £) ddim ( s ^, and this completes the proof of the lemma. Since the tour T is net-respecting, each 
long edge enters C at an s^-net point such that es l_1 < s e < es l . Therefore the long edges en- 
ter or exit C at one of (8s/e) ddim ^^ candidate portals, and by Lemma 1.1C there are at most 



2(8s/e) ddim ( s ) < (s/e) 2ddim ( s ) long edges entering or exiting C. Therefore the total number of 
entry and exit portals is at most r := r' + (s/e) 2ddim ^ 5 \ and so the long edges do not figure into 
the patchings. □ 

Let s = (log n) 1 /^" ddim ( ,s ')) for some constant c" > 32. We can now provide an efficient algorithm 
to find a tour as promised in the last lemma. 

Lemma 3.2. If S admits a net-respecting q-sparse tourT, then there exists a randomized algorithm 
that, with constant probability, finds a tour T' with w(T') < (1 + e)w(T) in time n O(2 4ddim ' s ') . 

20(glogg(ddim(S)/e) 4ddim ( s ) ty\ogn) 



Proof. If we could compute a hierarchical clustering that satisfies Lemma 3T , then the standard 



dynamic program from Section [2] would give a tour for S satisfying Lemma 3.2. However, we cannot 



compute this hierarchical clustering, since we do not have access to T and cannot know which radii 
are valid choices for h u . Instead, we present a dynamic program that guesses the proper value of 
h u . Since only a fraction of xoddim(g) °^ ^ ne can didate values for h u are invalid, recall that the 



exponential distribution of [ABN11] implies that a random guess for the value of h u is a valid value 
with probability at least 1/2. Hence, O(logre) independent random choices ensure that at least one 
choice for h u is valid with probability 1 — \, which by a union bound implies that with constant 
probability, for each net-point at least one of the O(logn) choices is valid. 

The dynamic program guesses a radius for each net-point (i.e., it tries all the O(logn) random 
choices made above) and builds the tour in a bottom-up fashion, from level 1 to level L. Recall 
that s = (logn) 1//c ddim ( s '). It follows that the number of levels in the hierarchy is L = 0(log s n) = 

( dd 'iii^iog ^ g n ) " Suppose by induction that the dynamic program for computing a tour at level 

1 — 1 has been completed. We show how to compute level i. 

Consider some i-level cluster C centered at u G Hi. C is formed by cuts from neighboring 
balls in levels above i, and we wish to enumerate all possible formations of C: Recall that we 
make O(logn) random choices for h u G [s l , 2s*]. Further, since u is within distance 4s* of 2 3ddim (' s ) 
other s l -net points whose radii may cut C, and we guess O(logn) radii for each of these net- 
point, C may be cut in (0(log n )) 23ddim(S) different ways in this level. Since C may be cut from 
above in all L — i levels, it follows that the number of possible formations for C is bounded by 
(0(log n)) 23ddim(S)i = n °(2 4ddim S) ). For each fixed formation, we must calculate the minimum 
(m,r)-light tour for each possible choice of r exit portals of C (m r possibilities). C has at most 
s 2ddim(S') ghjjd clusters, and so there are (0(logra)) 23ddim(S ' s2dd ' m(S) possible child formations. (Note 
that since the radii of higher level balls has already been fixed, we need not consider all ways that 
these higher level balls can cut into C's children.). For each fixed child formation, we consider 
each portal configuration for the set of children (( m r ) s2ddim(s:i possibilities). We then compute the 
cost of connecting the child portals to the r exit portals of C: This can be done by enumerating 

all graphs with one edge on each vertex, in at most (^rs ml - J different ways. We can 
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bound each of the expressions above by: 



0(mr log n ys 2dd[m{S) _ 20(glogg(ddim(5)/e) 4ddim ( s )s 4ddim ( s >(loglogn) 2 ) _ 20(glog<j(ddim(5)/e) 4ddim ( s )) ^lof^) 
Lemma |3.2| follows. □ 
Comment. For use in Section 0, we note that the previous construction gives an alternate (per- 



haps stronger) guarantee to that of Lemma 3.2. Let a /c-tour of S be a set of k open tours that 



amongst them visit all points of S. The best /c-tour is denoted as kOPT(S'). The term (m, r)-light 
with respect to /c-tours is unchanged; each cluster may only be entered or exited via the r portals, 
once per portal. Similar, a g-sparse /c-tour is one in which each i-level ball B(u, 3s*) covers at most 
qs 1 edges of the /c-tour. Then the following lemma holds. 

Lemma 3.3. // S admits a net-respecting q-sparse k-tour T, then there exists a randomized 

algorithm that with constant probability finds a k-tour T' with w(T') < (1 + e)w(T) in time 
0(2 4ddim < s >) . 20(glogg(ddim(S)/e) 4ddim ( s > ^logn) 

3.2 Eliminating dense areas 



Lemmas 3.1 and 3.2 show that sparse tours admit efficient hierarchical decompositions and algo- 
rithms. Here, we consider tours that have dense neighborhoods, and show that the point set can 
be divided into areas with all light tours. We then solve TSP on each subset, and join the resulting 

subtours into a single tour. 

g et q . = ( s / £ )0(ddim(S)) . 2 0(ddim 2 (5))_ 

Lemma 3.4. There is a (randomized) polynomial-time algorithm that given a set S (with \S\ > 1), 
computes two subsets S\ C S and S2 C S with S\ U S2 = S and S± n S2 ^ 0, such that 

(a) . OPT NR (S\) is q' -sparse, for q' = 0(q-\/log n) ; and 

(b) . w(OPT NR {S l ))+w{OPT NR {S 2 )) < w{OPT NR (S)) + ew{OPT NR (S 1 )). 

Proof. The algorithm begins by locating the lowest level i for which there exists u £ S such that 
w(MST(B(u,3s 1 )) > 2qs l , and setting v to be such that w(MST(B(v,3s 1 ))) is maximized. If no 



such i exists, then we conclude by Lemma |1.12|(i)| that OPT NR (S) is 13<? -sparse, and our lemma 



is trivial: Set Si = S and S2 includes an arbitrary single point. Assume then that i exists, and 
further that v has been found. Let q* := w(MST(B(v, 3s*))/(2s J ), and it follows that q* > q. 

Fix a tour T = OPT NR {S). Ideally, we would now like to choose some radius h, partition S 
into two point sets S± = B(v,h) and S2 = S\S±, and then show tours for the two sets, whose 
combined weight is only slightly greater than that of T. Let S* denote the edges of the complete 
graph on the points of Si (for i = 1,2); then we could bound the weight of the subtours by showing 
that Tj = S* OT (for i = 1,2) can each be made into a closed tour by adding only a light-weight 
collection of edges to "patch" the edges of T cut by the partition (as in Lemma |1.9| ). However, 
this plan may be costly because a radius h ball can cut many edges of T which then need to be 
patched. Moreover, in order to ensure that the subtours are net-respecting, we need to augment 
the sets Si with appropriate (nearby) net points. To solve this problem, we will show how to create 
sets Si D Si (which contains copies of some points of S2) and 52 D S2 (which contains copies of 
some points of Si) for which the lemma holds. 
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Below, we will choose (deterministically) a value h £ [12s*, 13s*]. Assume that h has been 
chosen, and let us focus on the long edges of T crossing Si, those of length more than <5s* for 
5 = O(e/2 10ddim ^). Since T is net-respecting, these edges must cross S\ at I- level net points, 
where t is the maximum value such that s < e5s % . We will patch T\ with respect to these long 
edges via the minimum spanning tree of the t- level net points covering S\. There are (s/e5) 4ddim ^ 
such net points, and their net-respecting MST has weight at most (1 + 6e)(s / e5) 4 ddim ^ ss> s l < eqs % 



(see Lemma 1.11 ), which bounds, up to a constant factor, the cost of patching the long edges 
crossing S\. 

Turning to the shorter edges of T (of length at most 5s l ) crossing Si, we show how to patch 
T\ with respect to these edges. Now, these edges cross into S\ from a set of points V C S2 within 
distance h + <5s* of v. Since we have shown how to patch the long edges of T\ crossing l-level net 
points, the short edges of T\ crossing V will be patched by connecting them to £'-level net points 
(though not necessarily directly, see below), where £' is the largest value such that s e < Ss i . To 
this end, add to Si copies of all /-level and lower level net points in 52 which cover points in V, 
and let the resulting point set be Si ■ Note that we can now patch the short edges using the MST 
of the new points, and then convert the new edges to be net-respecting. 

We now bound the cost of this patching (which also includes the cost of covering the copies 
of §2 points in Si). Define the annulus A(v,ri,r2) = B(v,r2) \ B{v,ri). The patching cost 
is bounded, up to a constant factor, by the sum of the minimum spanning trees of all balls of 
radius s l centered at /-level net points inside the annulus A(v,h — 5s l ,h + 5s l ). Let N(h) de- 
note the set of all such net points, and our algorithm chooses the value for h such that the sum 
J2ueN(h) w(MST(B(u, 4£'))) is minimum. The number of /-level net points inside this annulus is 



upper bounded by (2 • Us/5) 2 ddim ( s ) < (s/<5) 4ddim ( 5 ). By Lemma pip)! for each t' -level net point 



u we have w(MST(B(u, s e '))) < w(T n B*(u, As e ')) + (s/e) 2ddim( V! Recalling that / < s\ the 
total cost of the patching for short edges is bounded by 

w(MST(B(u,£'))) < [w(TnB*(ii,4/))]+( S 6 /e 2 5 4 ) ddim(S, s J . 

u&N{h) ueN(h) 

Now, each of the balls B(u, s e ') intersects at most 2 5ddim ^ other balls centered at ^'-level net points. 
Hence, it follows easily that Y^ ueN(h) w(TnB*(u,4s e ')) < 2 5ddim ^w(TnA*(v,h-55s i ,h + 55s i )), 
where A*(v, ri, T2) is defined as all edges with both endpoints inside the annulus A(v,ri,r2). 

It remains only to show there exists an h giving a low- weight annulus, and this will in turn 
bound the cost of the patching. By an averaging argument, there is a value for h for which 
A*(v, h — 5Ss l , h + 55s*) contains edges of total weight 0(5) ■ w(TnB*(v, 13s*)). (Here, we assumed 
that I2s i + 55s i <h< ^-bds^) By Lemma |"l.l2|(i)l w(B*(v, 13s 1 ) nT) < 7w(MST(B(v, 13s 1 '))). 



To bound w(M$T(B(v, 13s'))), first recall that v was chosen to maximize w(MST(B(v, 3s 2 ))). 
Now by repeated application of the definition of doubling dimension, B(v , 13s*) can be covered by 
24ddim(5) j-jgjjg Q f rac [i us 3 S * (with centers from S), and so MST(i?(f, 13s*)) is bounded by the cost of 
constructing a minimum spanning tree inside each of these small balls and connecting them together. 
It follows that w(MST(B(v, 13s*)) < 2 Addim ^w(MST(B(v, 3s*')))+2 4ddim ( s )26s i < 2 5ddim ( s ) -3q*s\ 
Therefore we obtain 

w(Tr\B*(u,As e ')) < 0{5) .2 10ddim ( s Vs*'. 

u£N(h) 

Since 5 = 0(e/2 wddim ^), this last bound is at most eq*s\ Recalling that q > e' 1 (s 6 / e 2 5 4 ) ddim ^ s \ 
we can conclude that the total weight of all patchings (including short and long edges) is at most 
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0(eqs l ) +eq*s i + ( s 6 /e 2 <5 4 ) ddim ( s V < 0{eq*s i ). Since h > 12s\ Si contains the ball 12s*) 
and so by Lemma |L12Tii)| w(OPT NR (S 1 )) > w(MST(B(v, 3s*))) - (s/e) 2ddim ( s )3s i = 2g * - 
(s/e) 2ddim (' s ')3s' > gV, so we may conclude that the patching cost is 0{e) ■ w(OPT NR (S 1 )). 

A similar argument applies to the patchings needed for S2, by adding to S2 all t- level net points 
of Si, as well as all £'-level net points of S\ covering points of the resulting set is 5*2. This 
completes the proof of part (b) of the lemma. 

We note that by construction S\ n Si 7^ 0, and that Si 7^ S (as implied by part [b] of the 
lemma). 

We now turn to proving part 



By the choice of i, for every j < i and every net-point u G Hj, 
and so by Lemma |1.12|(i)| , w(OPT NR (S 1 ) n B*(u,3s j )) < Uqs j . By the 



MST( J B(n,3s J )) < 2qs^ 

packing property, for every net point u G H h w{OPT NR (S 1 ) n B*(u,3s i )) < i4(4 s )ddim(s) qg i-i 
0( v^log n) • qs l , which completes the lemma. 



□ 



We are now ready to prove Theorem 1.3. 



Proof of Theorem [01. Given a point set S, if S contains a single point then we are done. Otherwise, 



we use the procedure of Lemma 3.4 to create two instances of TSP, Si C S and S2 C S. Si admits 



a (/-sparse and net-respecting tour Ti as promised by Lemma 3.4. A tour of almost the same cost 
(at most 1 + e factor larger) can be computed by the algorithm of Lemma obtaining a tour 
T{, where w(T{) < (1 + e)w(Ti). Set S2 is solved inductively (that is S is recursively replaced by 
S2), obtaining a tour T' 2 . The inequality 5i D Si ^ implies that separate tours T[ and T' 2 can be 
joined together to obtain a complete tour T at no additional cost. 

We now prove inductively that w(T) < fi^f) ■ w(OPT NR (S)). By the induction hypothesis we 

have that w(T^) < (i±f ) • w(OPT NR (S 2 )). Lemma (TJ implies that 



w(T{)<(l + s)w(Ti)< ^i±£ 



u»(OPT 7Vfl (5)) - w(OPT JVK (S* 2 ))). 



Therefore 



w(T)=w(T{) + w(Ti)<w(T[) + 



1 + g 
1 - e 



w;(OPT iV - R (S , 2)) < 



1 + e 
1-e 



^(OPT^CS)), 



proving the inductive claim. Finally, by Lemma |l.ll| we have that w(T) = (l + 0(e)) -w(OPT(S)). 
The runtime follows from executing the algorithm of Lemma |3.2| on the (/'-sparse sets of Lemma 



3.4. 



□ 



4 Spanner dependent algorithm 

Here we show that the techniques presented above imply a different TSP dynamic programming 
algorithm, which has efficient runtime if doubling metrics admit light low-stretch spanners. In 
particular, if a certain conjecture holds regarding the weight of such a spanner the runtime of the 
new algorithm has significantly better dependence on the parameters n, ddim(S') and e than that 
of our TSP algorithm described in the main part of the paper. 
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4.1 Metric spanners 

A graph H is a (1 + e)-stretch spanner for graph G if H is a subgraph of G that contains all nodes 
of G (but not all edges), and dfj{u,v) < (1 + e)dc(u, v) for all u,v £ G, where (1q(u,v) (djj(u,v)) 
denotes the shortest path distance between u and uonG (H). 

Let G be the full graph of an arbitrary <i-dimensional Euclidean set S. Then there exists a 
(1 + e)-stretch spanner for G with weight W^MST(S), where W E = e" e(d) [PNS95|, |ADM + 95|, 



and the bound on We is tight. Now let G be the full graph of a doubling set S. Then there 
exists a (1 + e)-stretch spanner for G with weight WdMST(S), where Wd = e _e ( ddim ( 5 )) log n 
HSmi09[| , though it is not known if this upper bound on Wjj is tight. (Of course, the lower bound 
for Euclidean spanners also applies to the more general doubling spanners.) In fact, the following 
conjecture has been the subject of much study in the spanner community: 

Conjecture 1. Wd is independent of n; perhaps Wd = £ _e ( ddim ( s )). 

We will give an algorithm for TSP whose runtime depends on Wd- 

4.2 Hierarchy and algorithms 

In this section, we briefly sketch a proof of the following theorem: 

Theorem 4.1. There exists a randomized algorithm that with constant probability computes a 
(1 + e)- approximation to an optimal TSP metric tour in time n (logn)^ WD2 logiogn^ 



The proof proceeds along the lines of the proof for Theorem |1.3| , but makes use of the doubling 
spanner G. (As an aside, we may assumes that all edges of G are unit length, and this construction 
can be attained via the work of [|GT08|| .) Let s = 0(1). If graph G is g-sparse, then we use the 



standard TSP dynamic programming algorithm to produce a (l + eW£))-approximation the optimal 
tour on G. However, we need not guess the valid radii for a net-point; instead, we choose a random 
radius from among those that cut at most ddim(S')(7 edges of G. We can then run the standard 
dynamic programming algorithm of | Aro98| , [Tal04 1 in time 



To handle the removal of dense subsets, we find directly the lowest level i that contains a dense 
ball. Let B(u, 3s l ) be the heaviest such ball in i. We choose the radius h u S [3s l , 4s l ] which cuts the 
fewest edges of G (at most ddim(S , )g). Let S± be the points inside B(u, h u ). We remove S± from S, 



find a ddim(S')g-tour for S± using Lemma 3.2, and then convert this into a closed tour by adding 
edges of the minimum spanning tree of the cross-points. The second set S2 consists of S — Si, plus 
copies of the points of the previous minimum spanning tree. The analysis is similar to that in Section 



3.2 , and shows that the weight of the minimum spanning tree is much lighter than w(OPT(S{)). 



The final analysis gives that the weight of computed tour is 0PT(5) + ew(G) = (1 + eWo) OPT(S'), 



and Theorem 44 follows by scaling e to 
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